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HYPER-TEXT DOCUMENT FORMATTING COLLATING AND PRINTING 

Field of Invention 

The present invennon relates to hyper-tex. MM -* * l-*"*" W ** 
network access, formatting and printing of hyper f*xt documents. 

Kaekground of the Invention 
Many compu^ W document matk-up IW have «• «- «*• 

„ a,,ow corroded document prepataUon. Example, of such Ungual 
TROFF TcX. RTF. as «0 » many I-**-* •-»» — ^ 
^ lord process applications. These tnerk-up *«-•- are designed to aiiow 
, 0 tll , computer assisted preparation of . document defined foe printing. As- . 

. ^valfsncc and active nature of digital 
consequence to these developments, the prevalence anu 

concurs r^encouranedtnemrr^^^ 

A iyper-U* is a pointer. typically embedded in a document, that provides a 
oirea ft* to another portion of the same document. ~d« decuman, another 
,. resource. avails on the cur™, nerwo* nod. or another-networlt «*. Hype,,**, 
A „ often used on the Internet, and in parucuiar th. Wo„d Wide wet, to II* a 
document at one Web sit* with a document at another Web stte. 

Hypc-linte are «* optional *• a document is vewed on-line, and no, 
M document is in prtored form. The increased v,„c of *«s. on-line »ype,te*. 
^^^causedawca^g^dteP-o-'^ 01 -^- Newton 

" C^s used — — - — * * "J 

r^Te» Mart-up T^uaee,. have few feaunes tu supr-r, *. descripnon of dwtr 

^form. More^.^-^--^^; 

P .. r ™.md by their authors in a manner which 

foron-Unevie^n^rheseoOCumetM-elorrnatnul^OK 

„d«. for screen viewing, and not necessarily for viewing in pruned form. 

mK *d in — — on^ne service, * *— which are ^P™le*r 
based viewine. but not « appropriate for - P«— *- *«* 
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because printing is not a focus of applications which access these hyper-text documents 
(thai is. hypertext browser applications), their printing facilities are generally poor. 
Common problems encountered when printing hyper-text documems include: 
. information is broken up into small hyper-tew documents, and many documents 
need to be collated to form a desired body of information; 

. text is formatted with fewer words per line dun » common for printed paces, and 
in general the density of information is less than is typical for primed pages; 

. hyper-text document viewiu* programs are document-centric, that Is they operate 
on a single hyper-text document « a time, which results in Utfc being the unit of 
printing, resulting in much tepetitive work by the user to print a set of linked hypeMwi^ 
documents, and typically no more than one hyper-tcxt document on each printed page^ 

. hyp«-tcxt document viewing programs generally do not print aJUhe feature, of 
hyper-text pages which are displayed on-screen (a display device).^ particular die 
target of hyper-links is often not included in printouts. 

It is possible for die provider of a hyper-rcxt document designed for screen 
viewing to also provide substantially the same document in a different form designed 
for priming, but this requires double handling by the document provider. It also often 
results in significant differences between die screen version of the document and the 
printed form. 

The problem of no mate than one hyper-text document per printed page can 
sometimes be addressed by the reduction and rotation of die image of each basic page 
and printing each .educed page image on. say, one half of a printed page. However 
^ method does nor save paper at a uivcu scale. For example, if , large number of 
S mall hypcr-texe documents are printed, each of which only occupies 25% of a printed 
(physical) pa*, even diough the documents are photo-redu^d and primed two per 
physical page, each physical page still has 75% blank space. Further, this method does 
nm ul ovide continuous page-length columns. Continuous coluuu. printing P rov ld e S 
improved readability and space utilization. 



An object of toe invention is to substantially overcome at least one of the 
aforemeniioucd problems in the formatting of hypertext documents. 

Summary of the Invention 

In accordance with one aspect ot the present inveniioa there is provided a method 
of collating hypcr-text documents comprising the steps of: 

(a) monitoring a user's access patterns » j£hypcr-te* documents; 

(b) accessing^ hyper-text documents including structure information of the 

Qnd. 

accessed hyper-teit documents^ 

( C) creating a formatted version of the accessed hyper-text documents for; 
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In accordance w,rh another aspect of the present invention there is provided a 
method uf collating hyper-text documents comprising steps of. 

(a) accessing^ hyper^t documents including structure information: 

(b) creating a formatted version of^acccssed hyper-text documents wherein 
$k formatted version is characterised by a single or multiple column printing such that 
"each printed page contains « many ojSfffcypcMca document, as can reasonably f« 

in an available space on a printed page. 

Other aspects and feature* uf the present invention are also disclosed. 
Brief Description of the Drawings 

A preferred embodiment of the present invention will now he described with 
reference to the accompanying drawings in which: 

Fig. 1 is a block diagram showing the operating environment of the preferred 

embodiment of the present invention; 

Rf . 7 show* the visual appearance of a user interface in accordance with the 

preferred ernbodimcnt. 

Fig . 3 is a block diagram of an internal structure of the preferred embodiment of 

the invention; 

Ffc. 4 is a block diagram of a genera, purpose computer upon which the preferred 
embodiment of the present invention can be practiced; 



*T~> — ^ n ° f a ^ ducutncnl " 

portion of ihc preferred embody. 

Description of the Preferred Embodunent 

3 ^ , pnf : nvent ion is described as a computer 

-frrred embodiment of die present invention 
, THc prefer™, em. _ wiod ^ otcrjliMjy)ra n develops ft yMic tUS oft 

\ However those skilled in die art wm^-reewn" 
\ Coloration. However. 

\ - implemented on computer -ysoms nu> 

embodimek may be implem«n ^ 
\ , l_ referred embodiment can be perrornwu 

" The application program Uas a user Interface 

u pcr»o M . TtaW*- proven, Has the * ^ 

• k ,«.Lv connect* to a host computer or a=^««« » 
printers either tacetft connecicd 

device 11 C&r viewing nyp computer system 

\ • imolememed on a general purpw » 

»** ^ ltd NETSCAPE TUe 

0. - *W< J" fom , an wH.cn — a 

networlc system 12 



to"'"* ' «_ 14 prrferabiy implement'* » a «>ft»are 

A ^t «— fctn— ^ a „. ■ 

../lUn- in P- * **- 15 *" 

A 

.oAciua*.^*-*^ 01 *""™ 5 

ihtuus OT0S68Ua 



rcpondiag * a ~" *• "H"'-** "" CUmeM '* 

-* — * 12 » * ° f ' nC ' Udta8 

form**™. «* -W* *— "■ «* < " Ka ton " ;llOM *" 
text browser 10 or the user. 

Mm to FH. 2. «*« k *~. a user tart- ° f - 

embodiment » • «- **• M " 4 ""^ * "™* 

ccntro. area «. • «» - 1 »* " ™ Ptinl 

d u P ,.v 22 m. . n« - PH- i— «a- «• -* * whi5h * p ;" 
iKm 1* bcx :4. a «h. *— «• « f * M a - » fCKh — 26 

,nd a .ocation « «" »• * P* «« 22 *™" he PtiM ^ M 

ar* scrollable by means of scroll bar contioli 28 awl 29. 

V. print pft*. oispuy 23 dismays W-4 reckons of * pr.nu* p*. 

* are to be P.— - - P™«< «"»« 13 ^ t l 
op tio„s, for «.mple l« • WSfWYO <■*. yc, - K vou S eO forma.. The 
„ is free . se.ee, from *e menu anu con^oU 2. a prim — "» * ^ 
prim option. Sucb>t option can bd* pri* sento for ft. print output device . 
po™. or UuAcape o^on of paacs. pri» resound «-» <*« 
<e,ec«on of . option. *. cone* print prev,e- display 23 is appropriate* updated^ 

• .i - ™„r n«view display 23 Is regenerated automatically 
However the display in Oie pnnt preview a»pu»jr 

0 current .pp.ica.ion «. cban.es without U»™ r^ulr* by .be user. AopUeauon 
!HKS „«, en effect tbe print pre.- display 23 incWc. but are no. »** <o. d< 

en™™., P— • * — *— - S T 

u «f m,rv«l items in a prim list (ie. those selected by a 
can beset by *c operator, the set of marked items in P 

„ , . _ mirk box 24) ^ the order Of marked items associated with the 
mark in the print item nwle box m» » 

25 P "° t 1,St ' . . . ^ Dractised using a coavcutional 

-me preferred embodiment of the uivcnuon can be pracu* 

^ fhosrt computer system, such as dxe computer system 40 shown In 
ffeneral-purpuse (host) compuia y 

Aic„,«~i above and co be described with 
Fi„ 4 wherein the application program discussed above 

• ; i^-nted as software executed on the computer 

reference to the other drawings is implemented as so 



such » a ^ « - — 43. ^. **» Uik. a pd-r U,»d . 

1*,,, * • *> " ^ ^ 

ca „ u used « oman ******* I— • - — 

p. concur ^ « M b*- - ° M PraCK50r " nit * 
^ «* «.. fcr c^U ^ — . — 
(RAM) .* - «* (KOM, <W (W» *— ~ * 7 

X1tkdrive5 4 a CD-ROM drive 55 is typically provided as 
disk drive 53 and a floppy disk drive 54. a * 

, rcc of da* The components 45 to 49 and 53 to 55 of the computer 

a non-volaule source of data, ine«.vun~ 

- M s. .„ interconnected bus 50 and in a manner which 
module 41. tvpically communicate via an interconnect 

-u, Caircstations or alike computer systems, 

include IBM-PC/ATs and compatible*. Sun Sparcstation- 

T " lux. - - * — is °" * ^ 

: » — — - - — - 45 ■ — — - z 

„ Ac prosn<» «"1 <~ P™» ^ 

IT- • CD-ROM „ _ «, — ** - * — - - - 

- * 3 — 1 "* ^ 30. . — S u* 31. 

^ . embod imervt, which comprises a user interface 
picfcrred OTboaim«u, MO raae 34. the pntw 



The internal prim list storage 34 is structured as a list of records in the 
memory 46 of the general purpose computer system 40. each record being referred to 
hereinafter as a "print item". Each print item represents ar teas: one hyper-text 
document, and comprises a Uniform Resource Locator (URL) by which the assorted 
hyper-text douutuent can be retrieved as well as a further list of record,, each of which 
is referred to herein as a sob-tern. Each sub-item represents a distinct file-like unit of 
data which is required to complete the formatting and displaying of the hyper-tcxt 
document associated with the print item. Th«e of data (or sub-items) arc most 
commonly hyper-text documents in HTML format and images in GIF or JPEG format. 
Each sub-item records a file name within the temporary file ^ragc where the unit of 

data will be. or is. stored. 

In Hg. 3. the four tasks 30. 31. 32. 33 are shown, each of which is implemented 
as a separate Uucad within a single application process. The internal print list 
storage 34 is shared by the tasks 30-33 in a manner to avoid conflicts. Each task 30-33 
gains access to the print list on the internal storage 34 by first obtaining a lock 
(mutually exclusive lock). Once the lock is otaiined. the ta.sk teads- and possibly 
modif.es the print l«t and then releases the lock. Upon release of the lock, if change, 
were made to the print list, messages are forwarded to the user interface task 30, the 
formatting task 33,and the data fetching task 32 » inform them that changes have been 
made. 

The user interface task 30 performs user interface operations by having a waiung 
state 30A and by acceptance of user interface events, «ch as clicks and movements of 
the mouse 43. responds to process 30B as appropriate to each event. Operation of the 
«sk 30 is achieved by a message loop structure processing each opening system 
senerated evenr in norland is linked to the print list display 22. 

The monitoring task 3, performs mooicoring 3LA of user initiated access to 
documents including hyper-text documents uslnz the hya«« browser 10. and 
entering ilB each such document accessed by the user into the print list. In pamcular. 
the browser 10 includes an application program interface (API) which allows v«w iug 
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of Wcrtnauon *m ^ by *. *-er <* this — «• - — "* ' 

^ 10 . *. — . *• »- - 34 * — «~ d ; m °" ™ . 

* «. hem, .ocata, The user is then able to - - *<* '<« - * *— « — 

iwrnsnoc required to be printed. 

» *~ — . « — - — - r; s :i 

- ^ 4.„ necessary for producing a visually pleasing 
. . a i ona «,jth associated data necessary ior v 

* ^ 'T3or r«»o* access technique* * * 

Inwrnel protocols and/or newonc 

and a wait st«e 3*A for *«*. -» — » - - ^ * 

sysiem *» .... CM rinc the daw in a lemporaiy 

fcttMnt 32B. for ft**, - ■*«- " ^ task „ „ , K0 

«... « rvokallY formed within the memory 46. Tr* teicrau, 

responsible for uuna.mg W •»** - jre „ yp „. 

^ nrfnc list or addins to che prim Ust hypcr-wxt pig , 
Amending the print list or ««« . . „ .. tvnicaliy 

• 1C1V fetched hv the fetching task 32, »« tvpicauy 

«~ « °' rwX^ ,0. Hyper-links previous* 
^nned as , background the hyper .ex* b 

visited fetch™. 32 are prefer*,, « - « ^ 

„ect. as pari of optional s^dngs that the farfun, usk 32 
. user uu, eiecr, as pa to , ^ 

predetermined number o, byper-Unk pages for ^ 
Preferably, the ffcehing task 32 provides a cross reform 

^nrion. which maiuralns a cross rete.encms <o UtU. -P 
u ^ Bte e I ordes« S ueh,op B or,.h K hr»u«a ^ 

.. ,. , k «««.ieM documents to Be pnwed (fonnart*! versuru 

tab of ^^^^^^^^i^docun.er.tobepr^. 

!5 ^ references and . «"*-^ a ^ of condors iisrins each 

„ ^ connection, fc , hc ^ 

««.c«,f«l in the document to be printed, cacn cu» j 
...pcr-tcx. docuu-r, represent « the ^ 

„, content is l^Ued win, *e posmou t» n^ber) 
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The focinatting task 33 performs formatting of all documents which arc listed in 
the print list in a manner suitable for printed output, and also A optional ly showing a 
preview of the primed output which would be produced in the print preview ar«a. Its 
operation Is achieved by a recursive descent HTML parser and formatter, and results 
from waiting 33 A tnr a change in die print list, and a format stage 33 B which formats 
xhc documents and forward^loa printer interface 37 for hard copy reproduction. 

Notwithstanding that the updating of the prim preview display 23 appears, under 
some circumstances, to depend on ^availability of a hyper-text document through the 
network, a substantial portion of the rasks described with reference to Fig. 3 are 
performed substantially instantaneously in^fiackground mode unbeknown or at least not 
immediately apparent to the user. Typically, the tasks 30-33 can be performed 
synchronously or asychronously with a user s access pattern. Usually, a user accesses 
or visits, with the aid of the browser application, root hypertext documents. Described 
in an alternative way. hyper-text documents visited by a user arc referred to herein as 
root hyper-text documents, and any further hyper-links and their associated documents 
are visited and fetched by the fetching task 32 respectively. The depth to which hyper- 
links arc followed in fetching hyper-t«XT documents is user defined. Preferably, all 
hyper-links of a root hypcr-tcxt document having predetermined characteristics are 
visited by the fetching task 12 and the associated (hypcr-text) documents are retrieved. 
For example, a user may mark hyper-links to be followed to a predetermined depth or 
the user may specify characteristics of hyper links, and their associated documents, to 
be all documents descenrtent from a current root hyper-text document containing 
a 

^predetermined keyword. 

Fig. 5 provides an illustrative representation of th* preferred embodiment use. 
Fig. 5 shows a display screen 60 of the display 11 which has rwo windows clearly 
displayed. A window 70 is a web-browser application window thai displays a text 
document 67 (corresponding « a few of the introductory paragraphs of this patent 
description). This forms a background window and is reprcscntat.vc of the hyper-icxt 
browsci application 10 covering the entire screen aiea. Superimposed on top Of rhe 
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window 70 Is a window 63 eroding » •»°*i»g " * e i """'" :ati, ' n 

program of the preferred embodiment, bribed csHier with reference to Fig. 2. The 
user in d* case u preparins a document »™ed (torn three source, each mentioned ,n 
*. otto. display list «. A to. source 68. eau«d FRED, is « staple text source 
previous encountered during a Web teview, and occupies a tta » *<= 

document beiu* formed. A second source 69. beins a picture of . vehicle, occupies a 
second posmon. - a third source, correspond** to the ground te„ 
document 67. occupies die third position. It b seen *« «- *f" I* that a 
Se,rch used to locate the text document 67 has been deselected (N-No, from 

displav and hence does to. appear in the WYSIWYG print previa 65. The display b. 

indicates - -* fMCM U " S C0 " eiP, " ,di " ,S ^ " KlKKd 

<Y.Yes, for display. In each cas«,the location identifier provides the Web site address 

for the source material. 

As seen in Fig. 5. the second column 64 of the print preview 65 has a Ma.* 
section 66 As seer, from the print display list 61. the text document 67 tomato* to a 

-fetchin*- state, where m « b *™ - fo^ » attt,1 ** WYSWY ° 

ai<p uy." Once this is completed, the s~ion 66 displays the text that has since been 
fetched and the print display list 61 is updated - Wieate a -fetched- Stan, tor that 
document- 

IT. compilte the print document, the application program, and in particular, the 
document forma- 33B. recuses that the width of FRED and «. picrure are 
^rtower than the page, and therefore «uh»sh«s a column corresponds to then- 
width Because of its lenph. the «« document 67 is tbrmatted.^ into a narrower. 
lcft hend column 62 re,a,cd to the width of FRED 68 and the picture 69. and then to 
, flo w into the r*. hand column 6a which Is ad^tcd to a widd, to substantially «n the 

• «„ B «m ■« configured to automatically detect the 
pape Importantly, the application program is ennngvrea 

« incorBoraie that content into the prim previcvr 
selected content of a source, and to incorporate 

display » C«> i" an eeonomtc, manner so that a, « hyper-text documents « can 
reasonably be fined to a page ean b. display*. IMS reduce, paper consumptton. 



T,,e preferred embodiment is confer* « opiate in background mode whilst 
U,. W r is .raversint: the W«id Wide Wei, .0 automatically create and forma! a 
pti«tab.c document repress . chronological H of the user's traversal of .he 
world Wide Web. Typically, .he preferred crimen, operates m a ground 
mod , „ . wmdow operadog behind a web orowser A. seen in Fi e . 0. a 

flowchart of pr««*"« '~ <" *= .hypcr-tcx. doom.™. formatting, porrion of the 
preferred embodiment commence* at a starti» S point 102. This en.ry point M to . 
SEp lM where the application attend » «• an HTML dement front a Web 
document currently bein 2 viewed using a Web browser program. M step 106. when 
foUows step 104. an assessment of data availability b made and if none is available, 
step 108 assesses whether or no. another document can be opened. If so, control « 
returned to step 104 for bandhng the new document. If «. documen. hmrfef .s 

completed at step 110. 

,f data is available at step 106. control is passed to step 112 where the HTML 
element of the current Web si* location is formatted into a standard form able » be 
primed usmg a. appiicadon program. At step 114. an assessment is underrate as to 
Werner or not the formatted element is able to fit on to me page to be printed. !f so. 
control .s transferred . step HI where the fon.ur.ed HTML document is emitted « » 
oorpu. document. If 0. formatted element does not ft on to the page as determ.ned by 
step 114, control is passed .0 step 1M ««* « 
binder of the formal dement. This enab.es cont,o. to be passed to step 1.8 for 
crateillg „, the remaining formal HTML document. After step 118. courro, u< 
passed to step .50 which assesses whefcer or no. .her. is a remainder, for 
over from s«p 11*. If so. c««ro. I* -ensrrcd .o s.ep IM - *- *e — - * 
formatted and processed in ft. matmer desert above. If mere is no remainder, 
control is reojmed to step 104 iu order 10 read the next HTML element. 

With the .rr.nc.rnen. described t, *. 6. whilst the user browses the World 
Wide weh. ft. applicauon program continually assesses the data being viewed in .he 
w window and automatic..,, formats *« dau. into a continuous prints 
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document displayed in the window A Hfof!e^ptShowft in Figs. 2 and 5. When the user 
has completed broking, the window of (he application program (ie. window 63 of 
Fig. 5). car. he selected. Using the print display list 61. the user can either .select or 
deselect certain documents located during the Web browsing session for priming. 
During the course of a browsing session, ,11 documents seen are automatically enabled 
in the print document window. Accordingly, prior to primi* all that is necessary to 
for the user to cull out or deselect those components not desired for printing. For 
example, if the user had made use of a search engine during the Web browsing session, 
chere may be lirde point in printing out the text associated with that search engine. AH 
Hat would be nectary to pnnt could he the actual document or Weh site tectum 
found as a result of the search, such as shown in the example of Fig. 5. 

A funher advantage of the present invention is that, in die printed document, at 
the completion of each section relating to an individual Web location, the actual Web 
.cation is printed onto the .tinted — so^e n,r has a permanent hard 
Clipy record of not only the intomation|&^ of the location of that source. 

The foregoing only describes one embodiment of the present invention, however, 
modifications and/or changes can be made thereto by a person skilled in the art without 
departing from the scope of the invention. 



